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£ — ' Abstract. Using isometric embedding of metric trees into Banach spaces, this paper will in- 

vestigate barycenters, type and cotype, and various measures of compactness of metric trees. 
A metric tree (T, d) is a metric space such that between any two of its points there is an 
unique arc that is isometric to an interval in R. We begin our investigation by examining iso- 



'. ' metric embeddings of metric trees into Banach spaces. We then investigate the possible images 



xo — 7r((xi + . . . + x n )/n), where n is a contractive retraction from the ambient Banach space 
X onto T (such a it always exists) in order to understand the "metric" barycenter of a family 
of points xi, . . . ,X n in a tree T. Further, we consider the metric properties of trees such as their 
type and cotype. We identify various measures of compactness of metric trees (their covering 
K^ , numbers, e-entropy and Kolmogorov widths) and the connections between them. Additionally, 

l-H ' we prove that the limit of the sequence of Kolmogorov widths of a metric tree is equal to its ball 

measure of non-compactness. 

1. Introduction. The study of injective envelopes of metric spaces, also known as met- 
ric trees, (T-theory or R-trees) began with J. Tits [52] in 1977 and since then, applications 
have been found within many fields of mathematics. For an overview of geometry, topol- 
ogy, and group theory applications, consult Bestvina [9]. For a complete discussion of 
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these spaces and their relation to global NPC spaces we refer to [13]. Applications of 
metric trees in biology and medicine involve phylogenetic trees [49], and in computer 
science involve string matching [5]. Universal properties of a t\ trees" (this is a special 
class of separable metric trees) in the family of separable complete metrically convex met- 
ric spaces have been discovered, and used to investigate Lipschitz quotients of Banach 
spaces, in [31]. 

Since metric trees are described by three different names and several definitions, we 
start with some definition we will use. The first three are classical, and can be found, for 
instance, in [10] or [25]. 

Definition 1.1. Let x, y <£ M, where (M, d) is a metric space. A geodesic segment from 
x to y (or a metric segment, denoted by [#,&/]) is the image of an isometric embedding 
a : [a, b] — ¥ M such that a(a) — x and a(b) — y. A metric space is called geodesic if any 
two points can be connected by a metric segment. 

Definition 1.2. A metric space (M, d), is a metric tree if and only if for all x,y,z e M, 
the following holds: 

1. there exists a unique metric segment from x to y, and 

2. [x, z] n [z, y] = {z} =4- [x, z] U [z, y] = [x, y]. 

Definition 1.3. (Metric convexity) Suppose T is a metric tree. A subset S C T is called 
metrically convex if S contains the metric segment connecting any two of its points. The 
metric convex hull of S (the smallest metric convex set containing S) is denoted by conS. 

Definition 1.4. Suppose T is a metric tree. Following [2], we call a t € T a leaf (or a 
final point) of T if t £ [x, y] (with x, y G T) implies t 6 {x, y}. 

Below are some examples of metric trees. 

Example 1.5. (The Real Tree) Let Ar denote the set of all bounded subsets of R which 
contain their infimum. For all subsets x and y in R, define a map d : Ar x Xr — > R by 

d(x, y) := 2max{sup(a;Aj/), inf x, inf y} — (inf x + inf y) 

where by xAy we mean the symmetric difference of the sets x and y. Then d is a metric 
on Ar, and (Ar, d) is a metric tree. For striking properties of this metric tree we refer to 
[19]. 

Example 1.6. (The Radial Metric, Spider Tree) Define d:l 2 xM 2 ^ R >0 by: 

y || if x = Xy for some AgK, 




d(x,y) = , 

\y\\ otherwise. 

We can observe the d is in fact a metric and that (R 2 , d) is a metric tree. 

Example 1.7. (Finitely generated trees) Suppose T is a weighted graph-theoretical tree, 
whose sets of vertices and edges arc denoted by V and £ , respectively. Let d e denote 
the length (weight) of the edge e. We construct the metric tree T, generated by T, as a 
union of elementary segments [ui,?^], where V\,V2 G V are adjacent. In this case, [vijVa] 
is identified with the edge e, connecting v\ and t>2, and d{v\,V2) = d e . The definition of 
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the distance d is then extended to T 2 in the obvious way. It is easy to see that all the 
tree axioms are satisfied. 

Example 1.8. (The spider with n legs) In many of our examples, we shall consider a 
subtree of the radial tree described above. Fix n £ N, and a sequence of positive numbers 
(°i)r=i! the spider with n legs is defined as a union of n intervals of lengths 01,... ,a n , 
emanating from the common center, and equipped with the radial metric. More precisely, 
our tree T consists of its center o, and the points (z,t), with 1 ^ i ^ n and < t ^ a.;. 
The distance d is defined by setting d(o, (i, t)) = t, and 

|*-s| i = j 



d((i,t),(j,s))- { t + s Mj 

Abusing the notation slightly, we often identify o with (i, 0). 

We can consider this tree as a finitely generated tree (see Example 1.7), arising from a 
"spider-like" graph, with vertices Vq, Vi . . . , v n , and edges of length 1 connecting vq with 

Vi,...,V n . 

The simplest spider - that with three legs - is called a tripod (this terminology comes 
from [50]). 

Example 1.9. (Non-simplicial Tree) In general metric trees are more complicated than 
metric graphs. Metric graphs are spaces obtained by taking connected graphs and metriz- 
ing nontrivial edges. Such a graph is a metric tree if the corresponding metric graph is 
connected and simply connected. For example, consider the set [0, oo) x [0, oo) with the 
distance d : [0, oo) x [0, oo) — > M^o given by: 

m x xfl^i-J/il if £ 2 =2/2, 

\xi + J/1 + \x 2 — 2/2 | if X 2 7^t/2. 

Set X n = (H™, — d), where H n is a hyperbolic n-space. Then the ultraproduct Y[ X n over 
some nontrivial ultrafilter U is the asymptotic cone H£J of H™, an example of a non- 
simplicial tree. In this metric tree, the complement of every point has infinitely many 
connected components. For further discussion of this space and construction of metric 
trees related to the asymptotic geometry of hyperbolic metric spaces we refer to [13] and 
[23]. 

We refer the reader to [10] for the properties of metric segments and to [2], [3] and 
[25] for the basic properties of complete metric trees. Below we list some useful notation 
and results. 

For x, y in a metric space M, we sometimes write xy = d(x, y). For x,y,z € M, we 
say y is between x and z, denoted xyz, if and only if xz = xy + yz. The following facts 
will be used throughout the paper: 

1. (Transitivity of betweenness [10]) Let M be a metric space and let a, b,c,d£ M. If 
abc and acd, then abd and bed. 

2. (Three point property [2], [25, Section 3.3.1]) Let x, y, z £ T (T is a complete metric 
tree). There exists (necessarily unique) w £ T such that 

[a;, z] n [y, z] = [if, z] and [x, y] C\ [w, z] = {w} 
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Consequently, 

[x,y] = [x,w] U [w, y] , [x,z] = [x,w] U [w, z], and [y,z] = [y,w] U [w, z] . 
3. (Compactness, [2]) A metric tree T is compact if and only if 

T = I J [a, /] for all a <E T and i* 1 is compact , 

/e-F 

where i* 1 is the set of leaves of T. 

We also need to mention several properties of metric spaces. 

Definition 1.10. A metric space (X,d) is said to be -hyperbolic (or to satisfy the 
four-point inequality) [19, 25] if, for any xi,x 2 ,X3, X4 in X, 

d(x\ 1 x 2 ) + d(x 3 ,X4) < ma,x{d(xi 7 x 3 ) + d(x 2 ,X4),d(xi,X4) + d(x 2 ,x 3 )}. 

X is said to satisfy Reshetnyak's inequality [50] if, for any Xi,x 2 ,x 3 ,X4 G X, 

d(x 1 ,x 2 ) 2 + d(xz,X4) 2 < d(xi,x s ) 2 + d(x 2 ,X4) 2 + d(xi,X4) 2 + d(x 2 ,x 3 ) 2 . 

It was proven in [18] (see also [25, Chapter 3]) that any 0-hypcrbolic metric space 
embeds isometrically into a metric tree. Moreover, a metric space M is a metric tree if 
and only if it is 0-hyperbolic and geodesic. We see below (Lemma 5.1) that the four-point 
inequality implies Reshetnyak's inequality. The converse is not true, as an example of a 
Hilbert space shows. 

Further properties of metric spaces are encoded in the definition below. 

Definition 1.11. A geodesic metric space X is called a CAT(O) space or a global met- 
ric space of non-positive curvature (NPC), (global NPC space) if for every three points 
Xq, X\, x 2 € X , the CN Inequality holds: 

2 ^(.To,^!) 2 d(x ,x 2 ) 2 d(x u x 2 f 
d{xo,y) < + 

whenever y is the midpoint of a metric segment connecting x\ and x 2 . 

For information on these spaces, the reader is referred to [13], [50], or [43]. In [47], it 
was shown that a geodesic space is a CAT(O) space if and only if it satisfies Reshetnyak's 
inequality. The class of CAT(O) spaces includes metric trees (see Lemma 5.1), as well as 
Hilbert spaces and hyperbolic spaces [27]. 

Generalizing the classical Banach space notion of uniform convexity, we follow [27] in 
defining the modulus of convexity for geodesic metric spaces. 

Definition 1.12. Suppose (M,d) is a geodesic metric space. For numbers R > 0, e <s 
[0, 2R], and a G M, let (Tm{o>, R, e) = inf{l — d(m, a)/R}, where m is the midpoint of a 
metric segment connecting x\ and x 2l and the infimum runs over all pairs (xx,x 2 ) with 
max{d(o, X\),d(a,x 2 )} ^ i?, and d(xi,x 2 ) ^ Re. Define the modulus of convexity of M 
by setting cr M {R, e) = inf ae M cr M (a, R, e). 

The CN inequality implies that, for any CAT(0) space M, 



(T M (R, e) > (T H (R, e) - (T H (e) = 1 - ^1 - ( 2 /4 
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(here, H is the Hilbert space of dimension greater than 1). In Lemma 5.2, we obtain a 
sharper estimate on the moduli of convexity of metric trees. 

2. Hyperconvexity and Metric Trees. 
Definition 2.1. A metric space X is hyperconvex if 

for any collection {B c (xi;ri)}i^i of closed balls in X with XiXj ^ r; + rj. 

The notion of a hyperconvex metric space was introduced by Aronszajn and Panitch- 
pakdi [4]. They proved the following theorem, which is now well known. 

Theorem 2.2 (Aronszajn and Panitchpakdi, [4]). X is a hyperconvex metric space if 
and only if X is a 1 absolute Lipschitz retract; that is, for all metric spaces D, if C C D 
and f : C — > X is a nonexpansive mapping, then f can be extended to the nonexpansive 
mapping f : D —} X. 

Hyperconvex spaces are complete and connected; the simplest example of hyperconvex 
space is the set of real numbers Rora finite dimensional real Banach space endowed with 
the maximum norm. While the Hilbert space £2 fails to be hyperconvex, the spaces £00 
and Loo ar c hyperconvex. The connection between hyperconvex metric spaces and metric 
trees is given in the following theorem: 

Theorem 2.3 ([32], [1]). A complete metric tree T is hyperconvex. Conversely, any hy- 
perconvex space with unique metric segments is a complete metric tree. 

3. Embeddings of Metric Trees into Banach spaces. Henceforth, we consider iso- 
metric embeddings of metric trees into Banach spaces. Note that there is a wealth of 
results concerning Lipschitz embeddings of graphs (including trees) into Banach spaces. 
In particular, the connections between Lipschitz embeddability of trees and superreflex- 
ivity were investigated in [11], and more recently, in [6] (they are discussed in more detail 
in Section 3.3). The distortion necessary to embed a metric tree into a uniformly convex 
Banach space can be found in e.g. [39] (by [30], this problem is equivalent to computing 
the distortion of embedding the corresponding finite tree). 

3.1. Embeddings into Loo. First we consider two embeddings into L m , with different 
properties. 

Theorem 3.1 (From [33], page 395). Let X be a metric space and a £ X , then J = J a : 
X — > £°°{X) : x 1— > (xm — am) m£ x is an isometric embedding of X into £°°(X). 

The embedding J a defined above is called canonical. When the space X is bounded, 
we can also use the embedding J(x)(y) = d(x,y). 

We can also embed a metric space X into a larger Loo space. To this end, pick to e X, 
and denote by Cx.t a = £ the space of 1-Lipschitz functions from X to K, vanishing at t$. 
Define the universal embedding of X into £oo(£) by setting, U(t) — (/(t))/ S £ for t G X . 
Below we show that U is indeed an isometric embedding, satisfying a certain "universal 
projective" property. 
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Theorem 3.2. 1. The map U described above is an isometry. 

2. For any 1-Lipschitz function g : X — > R, there exists a 1-Lipschitz affine functional 
g : ^oo(£) — ► R, such that g = g o U. 

3. For any 1-Lipschitz function g : X — > Z , where Z is a X-injective Banach space, 
there exists a X-Lipschitz affine map g : £oc{C) —> Z, such that g = g o U. 

Proof. (1) Fix i,j/Gl, and show that ||£/(a;) — U(y)\\ = xy. As any / G C is 1-Lipschitz, 
the definition of U yields 

\\U(x)-U(y)\\ = sup\f(x)-f(y)\^xy. 

fee 

To prove the reverse inequality, consider the function f x : X — > R : 1 1— > xt — xto • Clearly, 
f x E C, hence \\U{x) - U(y)\\ > \U(x) - f x (y)\ = xy. Thus, U is an isometry. 

(2) By translation, we can assume that g(to) = 0, hence g G C. Set g(a) = a g for 
a = (af)fec € ^oo(£), Then g is a contractive linear functional. Moreover, for any id, 
g(U(x)) = (U(x)) g = g(x), as desired. 

(3) Fix an isometric embedding I : Z — > l^ (r). Let P : ^oo(L) — > Z be a projection 
of norm not exceeding A. We can view I o g as a collection of maps h 1 : X — > R (7 <G F). 
By Part (2), each of them admits a 1-Lipschitz extension /i 7 . This results in a 1-Lipschitz 
map h = (h 7 ) : <?oo(£) — > ^oo(r), extending / o p. We complete the proof by setting 
g = P o h. m 

Note that the canonical embedding of T into £oo (T) need not share this property of 
the universal embedding. Indeed, suppose T = [0,1]. Consider the function g : T — >• R, 
defined by setting g(0) = 0,g(l/n) = 0, ,g((2n+ l)/(2ra(n+ 1))) = l/(2n(n + l)) (n€ N), 
and letting q be linear on each interval [— W, „ 2 ," +1 n 1 and [ 2 ," +1 n , i]. We claim that 

& ^ L n+1' 2n(n+l) J L 2n(n+1)' n J 

there is no affine bounded map g : Hoo{T) — > R such that g o J = g 1 where J is the 
canonical embedding. To show this, recall the definition of J (with x* = 0): for x, y G T, 

x -2y y < x 



h x (y) = J(x)(y) = |x-y|-|y| 

For neN, set a„ = (l/n+l/(?i + l))/2 = (2n+l)/(2n(n+l)), &„ = (l/n-l/(n+l))/2 = 
l/(2n(ra + 1)), F„ = /i 1/n , and G rl = ft 0n . By definition, g(F„) = 0, and g(G n ) = b n . 
Furthermore, ho = 0, and g(ho) = 0. Therefore, the extension g : £oo(T) —> R, if it exists, 
is a linear functional. 

Let H n = F n+ \ + F n — 2G n . A simple computation shows that 

r 2(l/7i - y) a n < y < 1/n 

ff„(l/) = ^ 2(y-l/(n + l)) l/(n+l)^^«„ , 
[ otherwise 

hence \\H n \\ = sup. y |i?„(y)| = 26„. By linearity, g(H n ) = -2b n . 

For N € N, let H = E^=i( 2 M _1 #n- Then ||#|| = 1, and g(H) = -N. As N is 
arbitrary, there is no g with the desired properties. 

Note also that there need not be an "injective" counterpart of the "projective" univer- 
sal embedding U. More precisely, suppose T is the "tripod" tree, described in Example 1.8. 
There is no isometric embedding A : T — » X with the property that, for any isometric 
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embedding B : T — >• Y (X and Y are Banach spaces), there exists a contractive affinc 
map V : Y — > X satisfying V o B — A. Indeed, suppose, for the sake of contradiction, 
that there exists an A with this property. Consider B\ :T — >• l^, taking (l,i) to i(l, 1), 
(2, t) to i(l, — 1), and (3,t) to — t(l,— 1). We can assume that A(o) = (as before, o 
denotes the "root" of T). Suppose V 1 oB 1 = A, for some Vi. Then A(2, 1) = -A(3, 1). 
Modifying B\ to obtain the "right" B% and S3, we show that A(l, 1) = —^4(3, 1), and 
A(l, 1) = —A(2, 1). But these three equalities cannot hold simultaneously. 

3.2. Embeddings into L\. Next we define the "semicanonical" embedding of T into a 
space Li(fi), with the measure /j, on T defined below (we follow the construction from 
[26]). For any two points x,y G T, denote by <f> xy the isometry from [0,d(x,y)] to [x,y], 
mapping to x. A set S C T is said to be measurable if 4>~y(S) is a Lebesgue measurable 
subset of [0, d(x,y)] for any x,y G T. By the transitivity of betweenness (Section 1) the 
intersection of two metric segments is either empty, a singleton, or a metric segment, 
hence any metric segment is measurable. 

For an interval [x,y] C T and measurable S C T, we define fi[ x ,y](S) = A(0~ 1 (S')), 
where A is the Lebesgue measure. Denote now by T the set of all finite unions F — 
U£ =1 [xfc,yfc] of disjoint unions of metric segments. For a measurable S C T, and F 
as above, set hf(S) = Y!l=i ^[x k ,y h ]{ s )- Finally, let /J, T (S) = fi(S) = sup FeJ r (jt, F (S). 
It is easy to see that fj, is indeed a measure, vanishing on countable sets, such that 
Mfl^i y\) = d(x, y) for any x,y e T. 

The "semicanonical" embedding {/ = t/ xo of T into L\(jj) (xq is a point in T), is 
defined by U(x) = Xteo,^]- To verify that [/ is isometric, note that, for any x,y G T, 
there exists a unique z€ [xo, x] s.t. [x, y] = [x, z] U [z, y], and [z, y] n [xo, x] = {z}. Then 
U(x) - U(y) = X[a;,«) ~ X[y,z), hence 

||[/(x) - LT(j/)|| = \\x[x,z)\\ + \\X[y,z)\\ = rf(x,z) +d(y,z) = rf(x,y). 

An embedding of a finitely generated tree into if is described into [2- r >, Section 2.5]. 

3.3. A characterization of superreflexivity. Recall that a Banach space X is called su- 
perreflexive if all its ultrapowers are reflexive, or equivalently, any Banach space which 
can be finitely represented in X must be reflexive. The reader is referred to [7] for many 
properties and characterizations of superrcflcxive spaces. 

Theorem 3.3. Suppose X is a non-superreflexive Banach space, T is a finitely generated 
metric tree, and e > 0. Then there exists a Banach space Y , (1 + e) -isomorphic to X , 
such that T embeds into Y isometrically . 

This theorem should be compared with the characterizations of superrcflcxive Banach 
spaces due to F. Baudier and J. Bourgain [G, 11]. Their results concern the binary tree of 
height nT n = U™ =0 { — 1, lp (n ^ 0), and the infinite binary tree Too = U™ {- 1, lp. We 
can view these objects as graphs, where the only edges are those connecting ([a]) with 
([a],±l) (a G { — 1, lp)- The graph structure induces the hyperbolic distance d, defined 
as follows. For a — (ai, . . . , ctfc) G { — 1, l} fe and (3 = (f3i, ...,f3i) G { — 1, 1} , denote by 
s = s(a,0) the smallest integer j for which aij+i 7^ Pj+i (if cci 7^ /3i, or if either k or £ 
equals 0, set s = 0). Let d(a, /?) = k + £ — 2s. 
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The Lipschitz constant of an embedding f : A —¥ B between metric spaces is defined 
as 

L(/) = SUP — r SUp , , , , .. 

x ± y d A (x,y) x?ydB{f(x),f{y)) 

(here d A and ds are the distances in the spaces A and B, respectively). We say that A 
has a Lipschitz embedding into B if there exists an embedding / : A — > B with finite 
Lipschitz constant. A family (A n ) is said to have a uniform Lipschitz embedding into B 
if there exist embeddings /„ : A n —} B, with sup„ L(/„) < oo. 

J. Bourgain [11] proved that a Banach space X is not supcrrcflcxive if and only if the 
family (T n ) has uniform Lipschitz embedding into X. Recently, F. Baudier [G] established 
that Too Lipschitz embeds into any non-supcrrcflcxivc space. Together with Theorem 3.3, 
these results yield: 

Theorem 3.4. For a Banach space X , the following are equivalent: 

1. X is not superreflexive. 

2. There exists a Lipschitz embedding of T into X. 

3. There exist Lipschitz embeddings f„ : T — > X (n £ N), with sup ra L(/ ra ) < oo. 

4- Any finitely generated metric tree embeds isometrically into an isomorphic copy of 
X. 

Remark 3.5. It is easy to note that a strictly convex space cannot contain a tripod, 
described in Example 1.8 (a Banach space X is called strictly convex if the equality 
2(||:r|| 2 + ||y|| 2 ) = ||:r — y\\ 2 implies x = —y). If X is separable, we can find an injection 
T : X — > £2, and equip X with the equivalent strictly convex norm |||x||| = (||x|| 2 + 
llTrcl) 2 ) 1 ' 2 . Thus, there exist non-supcrrcflcxivc Banach spaces which do not contain the 
tripod isometrically. This shows that rcnorming is essential in Theorems 3.3 and 3.4(4). 

For the proof of Theorem 3.3, we need a simple lemma. 

Lemma 3.6. Suppose a Banach space X is not superreflexive, and c £ (0, 1). Then for 
every n £ N there exists a family (xi)™ =1 in the unit ball of X , such that 

1. If (at) is a sequence of scalars, changing signs at most once, then 

IIE"=i a ^ll > 2c Er=iM- 

2. For every i, dist(xi,span[xj : j 7^ i]) > c. 

Proof. Fix A £ (c, 1). By a Ramsey-style result from [28], there exists m = m(n, c, A) £ N 
with the following property: if (yi)i^i i s a subset of the unit ball of a Banach space X , 
and \\yi — yj\\ ^ 2A whenever i ^ j, then there exist 1 $J s± < S2 < ■ ■ . < s n ^ m, such 
that dist(y 8i , span[y s . : j £ {1, . . . ,rt}\{«}]) > c for any i £ {1, . . . , n}. 

Now suppose X is not superreflexive. By [48] (see also [7, Part 4]), there exist 
j/i, . . . , y m in the unit ball of X , such that, for every 1 ^ k ^ m, \\yi + . . . + y^ — 
j/fc+i — ... — y m \\ > m + X—1. We claim that || J2j=i a jVj\\ ^ ^J2j=i \ a j\ if the sequence 
(cij) changes sign at most once. Indeed, suppose <x,- ^ for j ^ k, and ocj ^ for 
j Jj k + 1. By scaling, we can assume ^ \ctj\ — 1. Then ^ a 3 ■ ^ 1 for 1 Sj j ^ k, and 
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Js dj ^ — 1 for k + 1 ^ j ^ m. By the triangle inequality, 

m m 

\\^2 a jVj\\ > \\yi + ■ • - + yk-yk+i- ■ ■ --ym\\ -^2( l -\a 3 \) >(m + A-l)-m + l = A. 

By our choice of m, we can find the vectors ar*. = y Sfc with the required properties. ■ 

Proof of Theorem 3.3. A weighted tree graph T = (V,£) (V and £ denote the sets of 
vertices and edges, respectively) gives rise to the metric tree T, as in Example 1.7. Select 
«0 G V, and call it the root. Enumerate the immediate descendants of vq (that is, the 
vertices connected to v® by edges) by v±, . . . ,v rlt/j . For 1 ^ i ^ 7i0, let Oj = d(vq),Vi). For 
each «, enumerate its own immediate descendants vu, . . . , i>i ni , and set a^ = d{vi,Vij). 
Proceeding further in the same manner, we write V as the collection of points vs, for 
a finite collection S of finite strings S. Then Vs> is a descendant of i>s if and only if 
S' = S ■—■ j, for some j G [1,71.5]. Set 05/ = d(vs,vs')> where S is the unique immediate 
predecessor of S' . 

For Si,S2 S 5, write Si -< S2 if vs-i is a predecessor of vs 2 , or cquivalently, if 
S 2 = Si — ■ ji — ... — jfc. For S Q = (ii a . . . i kaa ) (a e {1,2}), set S1AS2 = (in •■• ifc i), 
where fco is the largest integer fc with the property that if-i — i-^. If there is no such k, 
set Si A 52 = 0. Then vs 1 ^s 2 i s the largest common predecessor of i^ and vs 2 - 

It is easy to see that the distance d on the set V (inherited from the tree T) is given 
by the formula described below. For S Q = (i\ a . . . ik a a) (a € {1, 2}), let fco be the largest 
integer k such that i^i = 1^2- Let S = Si A S2 = (in . . . ik i)- Then 

fel &2 

d(v Sl ,v S2 ) =d(v Sl ,v s ) +d(v s ,v S2 ) = J^ a (ni ... Qml) + J^ 0W.. Om2 ). (3.1) 

m— /co + 1 rn— fco + 1 

The main step is to renorm X (making it into Y) in such a way that there exists an 
isometry J v : V -> F. We then extend it to J : T -> Y so that J\ v = Jy. For £ e T\£ , 
there exist unique v±,V2 G V such that £ belongs to the elementary segment [v\, 1*2] ■ Let 
A = d(vi,t)/d(vi,V2)- Define J(£) = AJy(«i) + (1 — A)Jy(i>2)- Clearly, J is an isometry 
on any elementary segment. By the description of metric segments given in [10, Lemma 
15.1], J is an isometry on T. 

To construct Jy : V — > Y, denote the cardinality of V by N. By Lemma 3.6, there 
exists, for every M <G N, a family (xiM)fLi C -8(0; 1) such that 

11 J2 a «n >o- + 2 " M )~ 1 E m ( 3 - 2 ) 

for any sequence (a,) with at most one change of signs, and || Xa=i Q^iAfH ^ (1 + 
2 _M ) _1 maxi \a,i\ for any sequence [pn)- 

Introduce the lexicographic order <C on S as follows: if Si < S2, then Si <C S2. 
Otherwise, let S = Si A S2, and write S a = S -^ j\ a ^- . . . ■— j maa (&■ = 1, 2). We say 
Si <C S2 if jn sj ji2- Let <f> : S — > {1, . . . , N} be the monotone increasing bijection with 
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respect to the lexicographic order. Define Jm ■ V — > X by setting, for S = (i\, . . . ik), 

k 

3 = 1 

and JmM = 0, By (3.1) and (3.2), 

d(v Sl ,vs 2 ) > \\Jm{v Si )-Jm(vs 2 )\\ > (l + 2- M r 1 d{v Sl7 v S2 ). 
Passing to a subsequence if necessary, we can assume that 



$(ai,...,ajv) = lim || /^jXjmW 



exists for every sequence {a{)f =1 . For any such sequence, max^ |a»| ^ $(ai, . . . , «at) ^ 
J2i \ a i\- Thus, we can define a normed space Z by setting || ^,. ai€i\\ = $(ai, . . . , a/v), 
where {ei)fL 1 is the canonical basis for M. N . Denote the span of {xiMH—x in X by 
Zm-i and define the map Um '■ Zm —> Z : x%m ^ &i- Find M so large that c = 
max{||[/ A/ ||, ||^-/||} < VTT~e. Consider J = C7 M oJ M :V^Z. As || £\ c^H = E, 



Q; 



if the sequence (ctj) changes sign no more than once, the map J is an isomctry. 

To renorm X, embed Z isometrically into l^. Then there exists U : X — > i^, with 
U\z M = U M , and \\U\\ ^ c. For x G X, define ||x||y = max^Hxll, ||E/x||}. Clearly, 
c _1 ||x|| sC ||x||y ^ c||x||. Moreover, for x G Z A/ , ||i7x|| = ||£/a/x|| ^ c _1 ||x||, hence 
||x||y = ||C/mx||. In other words, Y contains Z isometrically. 

Consider J = Um° Jm :V ^ Z. As \\ J2i a i e i\\ = J2t \ a i\ if the sequence (a,) changes 
sign no more than once, the map J is an isometry. Therefore, the map Jm ■ V — > Y is an 
isomctry. ■ 

4. Barycenters of trees. There have been numerous attempts to find an appropriate 
"non- linear" notion of the barycenter of a set (or of a measure) in a metric space. Several 
possible definitions are discussed in [50]. In this section, we approach this problem for 
metric trees, using their injectivity. More precisely: suppose U is an isometric embed- 
ding of a metric tree T into a Banach space X, equipped with the norm || • ||. Suppose 
Xi, . . . ,x n G T, and let xq = (xi + . . . + x n )/n be their barycenter in X (we identify 
x G T with U(x) <E X). Let P = Pjj.t.x be the set of contractive retractions it from X 
onto U(T) (it is non-empty since T is injective). We try to describe P(xo). More gen- 
erally, suppose a = (ai)?=i i s a sequence of positive numbers, with J2k=i a k = 1- Set 
j(q) _ ^™ =1 auXk, and try to describe P(x^). 

Proposition 4.1. Suppose T is a complete metric tree, embedded isometrically into a 
normed space X . For xq G T and x G X , the following are equivalent: 

1. x GP(x). 

2. For any x G T , d(xo, x) ^ ||x — x|| . 

//, in addition, T is compact, then the two statements above are equivalent to: 

3. For any leaf (final point) y E T , d(xo, y) ^ ||x — y\\ . 

In the proofs below, we sometimes identify T with its image in the ambient Banach 
space, and d(-, •) with || • — • ||. 
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Proof. By the injectivity of T, (1) holds if and only if there exists a contractive map 

7r : T U {£} — > T such that -k\t = It, and 7r(x) = xo. 

This, in turn, is equivalent to (2). Clearly, (2) implies (3). To show that, for a compact 
T, the converse is true, recall the "Krein-Milman Theorem for metric trees" (Statement 
(4) in Section 1 , proved in [2] ) , which asserts that T = I J [xq , y] , where C is the set of 

leaves of T). For x G T, find y G £ such that x G [xo,y]. If ||xo — y\\ ^ \\x — y\\, then 

||x - x|| = ||xo - y\\ - \\x- y\\ < ||x - y\\ - \\x - y\\ < \\x- x\\, 
thus (3) implies (2). ■ 

Corollary 4.2. If then x G P(x( Q )), i/iera d(x ,x) < ^ k a k d(x k ,x) for any x G T. 
Proof. By the Proposition 4.1(2) 

\\xa - x\\ < || ^a k x k - x\\ = || ^a k (x k -x)|| < ^a fc ||x fe - x|| 

for any x G T, whenever tt(x^) = xq. u 

In certain cases, the converse to this corollary is also true: this is shown by the 
following two theorems. However, in general, the converse implication does not hold 
(Example 4.10). 

Theorem 4.3. Suppose T is a complete metric tree, embedded into £oo(T) in the canonical 
way. For Xo G T, the following are equivalent: 

1. x GP(x^). 

2. d(xo,x) ^ J2k a kd(xk,x) for any x G T. 

Proof. The implication (1) =>■ (2) follows from Corollary 4.2. We establish the converse. 
Recall that the canonical embedding takes x G T to h(x) G £oo(T), where h(x)(y) = 
d{x,y) — d(x*,y). Suppose d(xo,x) ^ ^ k a k d{x k ,x) for any x G T. By Proposition 4.1, 
we have to show that d(xo,x) ^ \\x^ — h(x)\\ for any x G T. We identify x^ with the 
function <fi : T — >• R, defined by 

4>{y) =^a k h(x k ){y) =^a k \\x k -y\\ - \\x* - y\\. 

k k 

Then 

||x< Q ) - h(x)\\ = sup \<j>(y) - h(x)(y)\ = sup | 5> fc (||x* - ]/|| - ||x - y||)| 

aeT y£T " 

^ ij^afcdlzfc ~ x ll ~ llz-^IDI = y^a/clkfc -a;|| ^ ||a? — x\\. 

k k 

■ 

Theorem 4.4. Suppose T is a compact metric tree, embedded into Li(fix) i n the semi- 
canonical way. For Xq G T, the following are equivalent: 

1. x G P(x^). 

2. d(xo,x) < J2k a k\\ x k - x\\ for any x G T. 
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Proof. As in Theorem 4.3, we only need to establish (2) => (1). Suppose a; is a leaf of 
the tree T. By Proposition 4.1, we have to show that, if Xo G T is such that d(xo,x) ^ 
^2 k a k d(x k , x), then ||xo — :c|| ^ \\i^ a ' — x\\. The semicanonical embedding of T into 
L\ identifies (€ T with X[x ,t] (by translation, we can identify x with 0). For each k, 
find u k G T satisfying [x ,x fc ] n [x ,x] = [x ,Ufc]- Then y k = x k - u k = X[x k ,u k \ and 
Zfe = a; — u k = X[x,u k ] have disjoint support, hence 

\\x h - x\\ = \\y k - z k \\ = \\y k \\ + \\z k \\ (4.1) 

Furthermore, \\xq — x\\ = \\u k \\ + \\z k \\ . Thus, 

J2^k\\x k ~x\\ =^a fc (||y fc || + ||z fc ||) ^ ^a fc ||x -x|| =^a fe (||w fe || + \\z k \\), 

which is equivalent to 

^Ofc||yfc||>y]aib||u fc ||. (4.2) 



/ J ^-K || &K M f / 

k k 



We have to show that 



\\i {a) x\\ = || $>*(** - x)|| > £ afc (|| Ufc || + ||z fe ||) = ||x||. 

k k 

In view of (4.2) and (4.1), it is enough to prove that, for any leaf a; G T, j J2 k a k{x k — x)\\ = 
J2k ak \\ Xk ~ x \- Thus, it suffices to establish that, at any point y € T, the signs of 
(xk-x)(y) = X[x ,x k ](y)-X[x ,x](v) are independent of k. If y $ [x ,x], then X[x ,x k ](y)- 
X[x ,x](y) > Ofor any k. On the other hand, tfy$ [x ,x], then X[x ,x k ](y) ~X[x ,x](y) < 
for any fc. ■ 

Remark 4.5. The sets {xo G T : d(xo,x) ^ ^ fc a k d{x kl x) Vx G T} were briefly discussed 
in Remark 7.2(iii) of [50]. Namely, consider the probability measure q = ^2 k a k^x k - The 
set of points described above was denoted by C* (q) . 

As shown by the following example, this set need not be contained in the metric or 
linear convex hull of xi, . . . , x„ (sec Definition 1.3 for the definition of metric convexity). 

Example 4.6. As an example, consider the points x 2 ; = (i, 1) (1 ^ i ^ 3) in a spider 
with four legs (defined in Example 1.8). If T is embedded into £oo(T) (respectively L\) 
in the canonical (respectively semicanonical) way, then P(xo) consists of o, as well as of 
all (j,t) with 1 ^ j ^ 4 and t ^ 1/3. In particular, (4,1/3) or rather, its canonical or 
semicanonical image belongs to neither the metric nor linear convex hull of {xi, X2, X3}. 

Certain information about P(xo) may be extracted from the following results. 

Proposition 4.7. Suppose a complete metric tree T is embedded isometrically into a 
normed space X, and x is a point of X . Then P(x) is a closed, metrically convex subset 
ofT. 

Proof. Proposition 4.1 implies that xo G P(x) if and only if d(x,Xo) ^ ||x — x|| for 
any x G T. This implies that P(x) is closed. Furthermore, suppose x l5 X2,G P(x), and 
xo G [xi,X2]. Then, by Section 2 of [50], 

d(x,Xo) ^ max{d(x,xi), d(x, X2)} ^ ||x — x|| 

for any x G T, which implies x G P(x). ■ 
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In certain cases, when the structure of x±, . . . ,x n in T is simple, we can describe 
P(xo) explicitly. For instance, if n = 2, then P(xq) = {to}, where xq € [£1,2:2] satisfies 
d(xi,xo) = d(xi,X2)/2 (equivalently, d(x2,xo) = d(xi,X2)/2). Indeed, 

d(x ,xi) = \\(xi +x 2 )/2 -.ti|| = d(xi,x 2 )/2, 

and similarly, d(x2,xo) — d(x±,X2)/2. If xq <£ [11,0:2], then there exists y € [2:1,2:2] such 
that [xo,x s ] = [xo,y] U [y, x s ] for s = 1,2. Then d(xo,xi) + d{xa,X2) > d(xi,X2), which 
contradicts the contractivcncss of the map taking xq to xq. Thus, xq is the unique point 
of [£1,3:2] satisfying d(xi,xo) = d(x\,X2)j2. 

In a more complex situation, consider the "tripod" T, with limbs of length 1 (described 
in Example 1.8). For < a < /3 < 1, we define (i, [a, f3]) = {(i,t) : a^t^ /3}. 

Theorem 4.8. Consider the points x% = (i, 1) (i = 1, 2, 3) in the tripod T described above. 
Suppose S is a subset ofT. Then there exists an embedding of T into a Banach space X 
such that S = P(xo) if an d only if there exist i$ £ {1, 2, 3} and Sj a $J f$ ^ 1/3, such 
that either (i) S — (io, [a, 0\), or (ii) S = (ig, [0, (3}) U ( U,;^^ (i, [0, a])) . 

Proof. First suppose T is embedded in a normed space X, and show that P(5o) is of the 
form described in the theorem. For 1 $J i SC 3, let di = ||xj — Xo\\- By relabeling, we can 
assume that d\ ^ e?2 ^ d 3 . Note that d 3 ^ 4/3. Indeed, 

Xi+ x 2 + X3 



X3 

= o 11(^3 - Xi) + (%3 - a:2)|| < -(d(x 3 ,xi) + d(x 3 ,x 2 )) = -. 

Furthermore, di + d 2 Js d{x\,X2) = 2, hence in particular, d\ Js 2/3, and <i 2 5^ 1. Let 
/3 = d.2 — 1, and a = |di — 1|. 

By Proposition 4.1, xo = (i,t) <E T belongs to P(xo) if and only if d(xi,Xo) ^ di for 
each i. Thus, xo = (1, t) € P(£o) if and only if two conditions are satisfied: 

1. d(xi, xq) = 1 — t ^ dx, or in other words, t ^ 1 — di, which translates to t Js a or 
t Js 0, depending on whether 1 — di is positive or negative. 

2. d(x2, to) = 1 4- 1 S$ c?2, or in other words, t ^ d 2 — 1 = [3. 

The third condition, d(x 3l xq) = I + t ^ d 3 , is subsumed in the second one, as d 3 Js d%. 
Thus, (1, t) G P(t ) if and only ifl-^<t<l-a. 

Similarly, xq = (2, 4) G P(xo) if and only if two conditions are satisfied: 

1. d(xi,Xo) = 1 + i ^ di, or in other words, £ ^ di — 1, which means either t ^ a 
(d\ ^ 1), or there are no suitable tfs (di < 1). 

2. d(a;2, xq) = I — t ^ d2, which is always true, since c?2 Js 1. 

Thus, the set of £ for which (2, t) € P{x ) is cither [0, a], or 0. The set {t : (3, t) € P(x )} 
is described in a similar fashion. 

Next we construct an embedding of T into a Banach space X, for which P(xo) = S. 
Suppose first ^ a ^ /3 < 1/3, and construct an embedding of T into Li(0, 2) with the 
property that P(xo) = (1, [a, /?])• Let c = 1 — 3/3, and a = (1 — 3a) /c. Define the functions 
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hih,h as follows: 



1 0<M^1 

l<tt<2 


f -a 

: / 2 («) = < 

[ l-ac/2 


^ u < c/2 

c/2 < u s$ 1 

1 < u sC 2 





fl^w^l-c/2 




—a 


l-c/2 <u^l . 




-(1 -ac/2) 


1 < u < 2 





/s(«) = 

Note that fifj ^ for z ^ j, hence ||i/j — sfj\\ = t + s for positive £ and s. Therefore, the 
mapping (i, t) i— > £/, describes an embedding of T into Li(0, 2). 
The barycentcr xq corresponds to the function g, given by 

-(o-l)/3 « G [0, c/2] U (1 - c/2, 1] 
5 (u) = { 1/3 c/2 < u ^ 1 - c/2 

1 <u sC 2 



Then 



and 



/ a — 1\ 2, . 1 — ac 

ll/i - sll = (l + —)c + 3 (1 - c) = 1 - —3— = l-a, 



ii <• M n - M c / a— 1\ 1 , c a - 1 / 



ac\ 
~2~) 



3 

By Proposition 4.1(3), P(^o) consists of all points xo G T such that d(xi,xo) ^ \\xi — xo\\ 
for i G {1, 2, 3}; that is, of all the points (1, t) with a ^ t ^ /3. 

To obtain S = (io, [0, /3]) U ( Uj^j (i, [0, a])) we modify the above construction, by 
setting c = 1 — 3/3, and a = (1 + 3a) /c. Then ||/ 2 — ,g| = I./3 — ,g| = 1 + /3, and 
||/i-.g|| = l + (ac-l)/3=l + a. 

A modification of this construction works in the "limit" case of j3 = 1/3. In this case 
embed T into M([0, 2]) (the space of regular Radon measures on [0,2]). As before, let 
Mi = h = X(o,i)j and set 

/i 2 = aS a + (1 - a)8 2 , [12 = a5i - (1 - a)S 2 , 

with a G [0, 1] to be determined later (here, S x is the Dirac measure supported by x). 
Once again, it is easy to check that the map (i,t) <-> t/ii defines an embedding of T to 
M([0,2]). The barycenter io corresponds to the measure 

v = -(a(5 + £i) +X(o,i) y 

hence 

11 2 , \ -, 1 ~ 2a j n 11 4 

\m -v\\ = 3 (1 + oJ = 1 — , and \\fi 2 - v\\ = \\fi 3 -v\\ = -. 

To obtain P(s ) = (1, [a, 1/3]), seta = (l-3a)/2 (then (l-2o)/3 = a). To end up with 
S = (1, [0,1/3]) U (U, ¥ i (z, [0,a])), set 0= (3a+l)/2. ■ 

Proposition 4.9. Suppose a metric tree T is embedded isometrically into Li(fi), x\,. .., 
x n are points of T , and a\ . . .a n are positive numbers, satisfying ^2 k a& = 1. If Xq G T 
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belongs to P(x( a '), then the unique point of con(xi, ... ,x n ), nearest to xo also belongs 
toP(x^). 

Proof. Let S = con(xi, . . . , x n ). Suppose xo £ P(x^ a '), or equivalently (Proposition 4.1), 
\\y — Xo|| ^ \\y — x^ || for any y £T. Only the case of xo £ S needs to be studied. Pick 
x £ S, and let x' be the point of [xo,x] with the property that d(xo,x') = inf{d(xo,y) : 
y £ [xo, x] n S}. In other words, x' is the point of [xo, x] H S, farthest from x. The set S 
is closed, hence x' € S. 

We claim that, for any u £ S, x' £ [xo,it], and consequently, d(xo,u) = d(xo,x') + 
d(x',u). Indeed, suppose, for the sake of contradiction, x' ^ [xo,u]- Then there exists 
z £ [xo,x']\{x'} such that [x',u] = [x', z]U[z, u\. By convexity, z £ S, which is impossible, 
by the definition of x'. 

Next, we show that x' £ P(x( a )). By Proposition 4.1, it suffices to show that, for any 
y £T, \\y — x'|| ^ \\y — x^\\. We consider two cases: 

(1) [x', y] n S is strictly larger than {x'}. As S is closed and metric convex, [x',y]nS = 
[x', z], for some z. We know that [xo, z] = [xq, x'\ U [x', z], hence [xo, y] = [xq, x'\ U [x', y\. 
Then d(x',y) = d(xo,y) — d(xo,x'), and therefore, 

d(x',y)^d(x ,y)^\\x^-y\\. 

(2) [x', y]nS = {x'}. In this case, note first that, for any u £ S, x' £ [y, it], and conse- 
quently, d(y,u) = d(y , x') + d(x' , it) . Indeed, if x' ^ [y,u], then there exists z £ [y,x']\{x'} 
such that [x',it] = [x',z] U [z,it]. Then z £ S, which contradicts our assumptions about 

V- 

Now recall that the ambient space is Za(f2, n). We can assume that x' = 0. Then, for 
any u £ S, \\y — u\\ = \\y\\ + ||it||, hence yu ^ /i-a.c. (we view y and it as functions on Q). 
As Xi, . . . ,x„ £ S, we also have x^y ^ /i-a.c.. Therefore, ||y — x(")| ^ ||i/|| = rf(x', y), 
which is what we need. ■ 

Example 4.10. Proposition 4.9 doesn't hold for embeddings into arbitrary spaces. Con- 
sider the "spider" T = {(i,t) : i £ {1,2,3,4},0 < t < 1}, as in Example 4.6. Embed T 
into^ by setting (l,i) *-> {-e 1 +e 2 + e 3 )t, (2,t) i-> (ei-e 2 + e 3 )i, (3,t) h^ (ei+e 2 -e 3 )£, 
and (4, £) n- (ei + e 2 + e 3 )£, (ei, e 2 , e 3 denote the canonical basis in l^). For i = 1,2, 3, 
let Xj = (i, 1). Then the "linear" barycenter of {xi,x 2 ,x 3 } is xq — (ei + e 2 + e 3 )/3. As 
this point lies on the image of T in l^, P(x ) = {(4, 1/3)}. This example also shows that 
the converse to Corollary 4.2 doesn't hold. 

Finally, we present an example suggesting that nothing non-trivial can be said about 
the distance from the "linear" barycenter of a tree to the tree itself. 

Example 4.11. Consider a "spider" T with n limbs of length 1, that is, the set of points 
(i, i), with 1 ^ i ^ n and ^ t ^ 1, with the usual radial metric. For 1 ^ i ^ n let 
x, = (i, 1). Then there exists an embedding of T into Li(l, n+1) such that ||x — xq\\ ^ 1 
for any x £ T. Indeed, the embedding taking (i,t) to X(i,i+t) nas the desired properties. 

5. Type, cotype, and convexity of metric trees. In this section we consider prop- 
erties of metric spaces, such as the four-point inequality and Rcshetnyak's inequality (see 
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Definition 1.10), type, and cotype. The notion of metric type was introduced in [12] (see 
also [45]). More recently, metric cotype was defined in [40]. 

Lemma 5.1. The four-point inequality implies Reshetnyak's inequality. 
Proof. Suppose the elements x\,x 2 ,x 3 , x 4 of a metric space (X, d) satisfy 

d{x\,x 2 ) + d(x 3 ,x 4 ) ^ ma,x{d(xi 7 x 3 ) + d(x 2 ,X4), d{x\, x±) + d(x 2 ,x 3 )}, 
and show that 

d(x 1 ,x 2 ) 2 + d(x 3 ,x 4 ) 2 < d(xi,x 3 ) 2 + d{x 2 ,x 4 ) 2 + d(x 1 ,x i ) 2 + d(x 2 ,x 3 ) 2 . 
By scaling and relabeling, we can assume that 

d(xi,x 2 ) + d{x 3 ,x 4 ) = 1 < d(xi,x 3 ) + d{x 2 ,Xi). 
Let a = d(xi,x 2 ), b = d{x\,x 3 ). Then d(x 3 , X&) = 1— a, d(x 2 , x 4 ) = l—b, and furthermore, 
d(xi, X4) ^ \d(xi, X3) — d(xs, x±)\ = \a + b — 1|, and 
d(x 2 ,x 3 ) ^ \d(xi,x 2 ) - d(x ll x 3 )\ = \a- b\. 
Thus, it suffices to show that, for any a G [0, 1] and b ^ 0, 

a 2 + (1 - a) 2 ^b 2 + {l-b) 2 + (a + b- l) 2 + (a - b) 2 . 
The last inequality is easily verified. ■ 

Therefore, any metric tree is a CAT(O) space. Below we show that metric trees are 
"more convex" (that is, their moduli of convexity are larger) than those of "generic" 
CAT(O) spaces. 

Lemma 5.2. Suppose T is a complete metric tree. Then, for any R > and e G [0, 2R], 
cr M (R,e)^ l-e/2. 

Proof. Suppose a,x\,x 2 G T are such that max{d(a,a;i),(i(a,a;2)} ^ R, and d{x\,x 2 ) ^ 
Re. We have to show that d{a, m) ^ R — Re/2, where m is the midpoint of [xi, x 2 ]. Find 
y G [£1,2:2] such that [a, x{\ = [a, y] U [y, xi], and [a, X2] = [a,y] U [y,X2]. Relabeling 
if necessary, we can assume that y G [x2,m]. Then m G [y, Xi], hence R ^ d(a,xi) = 
d(a, m) + d(m, xi) > d(a, m) + Re/2. Thus, d(a, m) < i? - Re/2, m 

Definition 5.3. Suppose 1 ^ p < 2, and K > 0. A metric space (X, d) is said to have 
metric type p (or BMW type p), after Bourgain, Milman, and Wolfson, who introduced 
this notion in [12]) with constant K if, for any n G N, and any function / : { — 1, 1}" — > X, 
we have 

n 

J2 d(f(e),f(~e)) 2 ^Kn^- 1 / 2 >T ]T d(f(e), f(e^)) 2 , (5.1) 

eG{-l,l}" eS{-l,l}" l=1 

where (ei, . . . , e n )W = (e, . . . , £i-i, — £j, £j+i, . . . , e n ). On an intuitive level, we can think 
of the points /(e) as vertices of a "cube." Then the left hand side of (5.1) is the sum of 
the squares of the "diagonals" of this cube, while the right hand side involves its "edges." 
We do not quote the definition of metric cotype, due to space constraints. Instead, we 
refer the reader to [401. 
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Theorem 5.4. 1. Any metric space satisfying the four-point inequality has metric type 
2, with constant 1 . In particular, this result holds for metric trees. 
2. Any complete metric tree has metric cotype 2, with a universal constant. 

Proof. Part (1) was proved in [42]. For Part (2), recall that any L\ space has cotype 
2, with the constant y/2 (this classical fact can be seen, for instance, by combining the 
Khintchme constant from [51] with the basic properties of cotype, described in e.g. [30]). 
Therefore, by Theorem 1.2 of [40], L\ has metric cotype 2, with the constant 90-^/2- We 
have seen that any finitely generated metric tree embeds isometrically into i±, for some 
N. As the cotype passes to subspaces, any finitely generated tree must have metric cotype 
2, with constant 90-\/2~- Finally, metric cotype is a "local" property, hence any complete 
metric tree must possess it. ■ 

We next tackle the negative type of metric trees. Recall that a metric space X has 
negative type p {p > 0) if, for any X\, . . . ,x n G X , the n x n matrix (d(xi, Xj) p ) is condi- 
tionally negative definite. Recall that a Hcrmitian matrix A = (o,ij)2j=i) is conditionally 
negative definite if Y17j=i a ij^j ^ whenever the vector £ = (£i)i=i satisfies J2i & = 0- 
The notion of p-negative type is equivalent to p-roundness, see e.g. [22, 37]. Negative type 
is strongly related to positive definiteness of kernels, and to embeddability into L p -spaces 
(see e.g. Section 8.1 of [8]). 

It was shown in [29] that any metric tree has negative type 1. Therefore, it has negative 
type p for any p G (0, 1]. We shall show that a metric tree need not have negative type p 
for p > 1. More precisely, consider the "spider" T n , consisting of a central point, and n 
limbs of length 1 . 

Proposition 5.5. If p > 1, then T n fails to have negative type p for n large enough. 

Proof. Suppose n > c/(c — 2), where c = 2 P . Consider the subset of T n , consisting of 
the central point to, and the endpoints ii,... , t n . The corresponding (n + 1) X (n + 1) 
distance matrix is 

( 1 1 1 ... 1 ^ 

1 c c 

1 c c 



C = 



\ 1 



o / 



We shall show the existence of £ = (£,o , £,i , ■ ■ ■ , £, n ) such that £i + . . . + £„ = — £q, and 
(C£j£) > 0. Note that, for £ as above, D£ = 0, where the D is a matrix of all whose 
entries equal 1. Let 



1 



(C - cD) 



(a 

1 


1 
a 


1 



1 




. 1 

. 


\ 


1 





a 





. 




\ 1 











a 


) 



where a — c/(c— 1) < 2. It suffices to find £ = (£o,£i, 
and (A$, < 0. 



, £„) such that £i+. . . +£ ri 



<o, 
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An induction argument yields the determinant of this (n+ 1) X (n+ 1) matrix: det A = 
a n+l — na"^ 1 . Thus, A has n — 1 eigenvalues equal to 0, as well as non-zero eigenvalues 
Ai = a — ^/n, and A2 = a + \fn. The corresponding normalized eigenvectors arc rj 1 = 
(-•v/n, 1, • ■ • , l)/\/2n and if = (y/n, 1, . . . , l)/\/2n. Then, for any £ € ^, 

(Ae,0 -A 1 |(^. ? 1 )| 2 + A 2 K£,r 7 2 )| 2 . 

Now consider 77 = (\/2n, — v2n, 0, . . . , 0). Then (^n 1 ) = — V" ~ 1: and (^V 1 ) = V^ ~ 1- 
Therefore, 

(A£, £) = (a - V™)(V™ + I) 2 + (a + v^XV^ - I) 2 = 2a(n + 1) - 4n, 

which is negative, by our choice of n. m 

Finally, we note that all metric trees have Markov type 2 [41]. 

6. Entropy Quantities and Other Measures of Compactness. 

6.1. e-entropy and related quantities. Kolmogorov introduced the notion of e-entropy as 
a measure of the massiveness of sets [34] . This notion has been useful in function spaces 
(see [24]), especially with asymptotic distribution of eigenvalues of elliptic operators, or 
as a way of measuring the sizes of spaces of solutions to PDE's [Hi]. Recently entropy 
and n- widths has been utilized as a measure of efficiency in the task of data compression 
(see [21], [46], [20]). In this section we examine the notion of entropy and its connection 
to the fact that complete metric trees are centered (Theorem 6.2). This proves the useful 
fact that the e-entropy of a bounded subset A of a tree T is equal to the e-entropy of A 
relative to (T,d). We also connect the covering numbers of a compact subset of a tree 
with these of its convex hull (see e.g. [14] for some Banach space results in the same vein). 

Definition 6.1. Suppose A is a subset of a metric space M. 

• A is centered if for all U C A such that diam (U) = 2r, there exists a E A such that 
U C -B c (a; r). By B c (a; r) we mean the closed ball of radius r centered at a. 

• {m^iei C M is an e-net for A in M if 

Ac \jB(mi;e) 

iei 

• {U a }aei, where U a C M, is an e- cover for A if diam(C/ Q ) ^ 2e and 

Ac\JU a . 

a£l 

• U C A is a e-separated subset of A if 

e ^ XiXj for all i,j £ I with i ^ j, Xi 7^ Xj € U. 

Let Af e (A) (ICf 1 (A)) be the cardinality of a minimal e-cover of A (respectively minimal 
e-net for A in M). Define Ai e (A) as the maximal cardinality of an e-separated subset of 
A. 

Note that, if A is a complete metric tree, then it is injective, hence K.f*(A) = K,f(A) 
for any ambient space M. 
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Theorem 6.2. Every complete metric tree T is centered. 

Very few spaces are centered. A typical example of a space which is not centered is 
R 2 . This can be seen if one tries to locate a center for an equilateral triangle of side length 
2r so that its distance to all points is at most r. 

For the proof we require a lemma. 

Lemma 6.3. Let A be a subset of metric tree T with diam(A) = 2r. Then for all e > 
exists 77i G con(A) such that A C B(m; r + e). 

Proof. For all e > 0, there exists x, y G A such that xy > 2r — 2e and let m be the 
midpoint of [a;,y]. 

Let z G A, then by the three-point property of metric trees, there exists w G [a;, y] 
such that [z, x] PI [z, y] = [z, w]. Without loss of generality we can assume that m G [w, x] 
and hence w G [z,x] by transitivity. Next diam (A) = 2r and w G [z,x] imply that 

icy , . 

2r ^ zx ~ zm + mx = zm + — > zm + (r — e), 

which implies r + e > zm. Therefore, A C B(m; r + e). ■ 

Proof of 6.2. Let £/ be a bounded subset of a metric tree T, and let diam ([/) = 2r. 

For all n G N, there exists x n , y n G C/ such that x„y„ > 2(r — n^ 1 ). Let z n G T be 
the midpoint of [x„, y n ], and we claim {z n } is a Cauchy sequence. 

Let < 2N~ l < e with N G N, and then let n,m ^ N. Let u G [x n ,y n ] be such 
that [z m ,u] = [z m ,x n ] n [z m ,y„], by swapping x n and j/„ we can claim without loss of 
generality that u G [z n ,y n ]. Therefore, z n G [x n ,z m ]. 

Since metric segments are closed under intersections, z m G [x m , y m ], and z m is an end 
point of [xn,Zm], we have that [x n ,z m ] n [x m , y m ] = [z m , v] where w G [x m , y m ]- Hence, by 
switching x m and y m we can claim without loss of generality that v G [x m , z m ]. Therefore 

Zjn G [X n , ym\ ■ 

Since diam([7) = 2r, cc„,y rn G C/, z„ G [x n ,z rn ] and z m G [x n ,y m ], we have 
z7* ^5 x n y m x n z n -\- z n z rn -\- z 7n y rll 

> (r - tiT 1 ) + z„z m + (r - 777" 1 ) ^ 2r - 2iV~ 1 + z„z m . 

Therefore, z n z m < 2N" 1 < e and {z„} is Cauchy. M is complete so let lim n z„ = z. 

Suppose that there exists u G U such that zu > r + 2e for some e > 0. Since lim n z n = 
z, we can find a n such that n~ 1 < e and zz n < e. Furthermore, by the proof of Lemma 6.3 
we know that z n u < r + n^ 1 < r + e. Hence, by the triangle inequality, zu ^ zz n + z n u < 
r + 2e, which contradicts that zu > r + 2e. Hence, U G B c (z; r) and therefore, T is 
centered. ■ 

Remark 6.4. Alternatively, one can prove Theorem 6.2 by recalling, from Theorem 2.3, 
that any complete metric tree is hyperconvex. By the definition of hyperconvexity (Defini- 
tion 2.1), any hyperconvex set is centered. However, our proof relies only on the properties 
of the metric segments, and thus sheds more light on the local property of trees. 

Theorem 6.5. If A is a subset of a complete metric tree T, then lC^(A) — Ai e {A). 
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Proof. Any complete metric tree is centered. Thus, any e-net for A is equivalent to an 
e-cover. ■ 

Next we connect the covering numbers J\f e (S) of a compact subset S of a tree T with 
those of its convex hull (see e.g. [14] for some Banach space results). 

Theorem 6.6. Suppose S is a compact subset of a complete metric tree T and £1,62 are 
positive numbers. Then 

A4 1+ e 2 (con(S)) ^ N ei (S) [diam S/(4e 2 )] . 

Proof. For the sake of brevity, set N — N ei (S), D = diam (S), and 5" = con(S). Convex- 
ity of the norm (see [50]) implies that the diameter of S" equals D. By Theorem 6.2, there 
exists xo <G S 1 such that for any y £ S", d(xo,y) ^ diam(S")/2 = D/2, and moreover, 

S> = U ^o,^]- 
xes 
Find £1,... , xjv £ T such that for any x £ 5 1 there exists i with the property that 

d(x,X{) ^ ei. Let x' t be the point of 5" which is closest of X{. Then d(x' i ,y) ^ d(xi,y) 

for any y € 5'. Indeed, there exists z £ [x^y] such that [xj,x^] = [xi,z] U [z,a^]. By 

convexity, z £ S', hence z = x' iy which is what we need. 

Now let K = |\D/(4e2)l • For each i, find the points {yij)f = i on [jcq, xj] in such a way 

that d(x ,yii) < £2, d(x' i ,y iK ) < e 2 , and d(y i3 ,y itj+ i) < 2e 2 for 1 < j < if. In total, we 

have iViC points y^. It remains to show that, for any y £ S, d(y,yij) ^ ei + £2 for some 

As we have observed, there exists x £ S such that y £ [xq,x]. Find i such that 
d(a;, x'A ^ ei. By Corollary 2.5 of [50], there exists z £ [xq^'/] such that d(z,y) < ei. 
Furthermore, there exists j such that d(z,yij) ^ £2- By the triangle inequality, 
d{y,Vij) < ei +e 2 . ■ 

6.2. Kolmogorov numbers. Kolmogorov introduced the notion of diameters (or widths) 
to generalize many of our intuitive ideas about "flatness" of compact subsets of linear 
spaces. Since then, Kolmogorov diameters have been widely used in approximation theory 
(see [44] and references therein). On the other hand, the notion of the measure of non- 
compactness of a subset of a metric space was introduced by Kuratowski [35] as a way 
to generalize Cantor's intersection theorem. In 1955, Darbo [17] applied measures of 
non-compactness to prove a powerful fixed point theorem. Since then measures of non- 
compactness have been a standard notion in fixed point theory. In the following, we define 
these two concepts and show the connections between them. 

Definition 6.7. Given a subset A of a normed linear space X and n ^ 0, define the n-th 
Kolmogorov diameter (n-width) of A in X as: 



5 n {A, X) = 6 n (A) := inf < sup d(a, M) \ M is a n-dimensional subspace of X 
The n-th afflne Kolmogorov diameter of A in X is defined as: 
5^ (A, X) = Sfi (A) := inf I sup d(a, M) \ M is an affinc subspace of X, dim M < n 
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Observe that the sequences {5 n (A)} n a =1 and {5 n a '(A)} n ° =1 are non-increasing, and 

5 n (A) > 6 n a \A) > 5 n+1 (conv(Au(-A))). 

Indeed, the left hand side inequality is obvious. To establish the right hand side, suppose 
5n (A) < c. Then there exists an afRne subspace M, of dimension not exceeding n, such 
that for any a £ A there exists m £ M satisfying \\a — m\\ < c. Any x £ AU (—A) can be 
expressed as x = J2i=i a i a i? with J^V \ai\ ^ 1, and <Zj € A. For each i, find mj € A/ such 
that ||aj — mi|| < c. Then M' = conv (M U (— M )) is a linear subspace of dimension not 
exceeding n + 1, m = J^ oumi £ M', and ||x — m|| < c. 

Furthermore, if A is centrally symmetric about in X, then S n (A) = S n (A). Indeed, 
fix e > 0, and find an affinc subspace Mclof dimension not exceeding n. such that 
for any a £ A there exists m £ M with the property that ||a — m|| < 8j? (A) + e. By 
symmetry, for such an a we can also find m_ £ —M satisfying ||a — to_| < Sn (A) + e. 
Note that m! = (to + to_)/2 £ M' = M + (— M), and the latter is a linear subspace of 
X, of the same dimension as M. By the triangle inequality, ||a — to'| < Sn (A) +e. Thus, 
8 n (A) ^ 5n (-A) + e. As e > is arbitrary, we are done. 

Definition 6.8. Suppose A is a subset of the metric space M. Define the ball (Hausdorff) 
measure of non-compactness and the set measure of non-compactness as 

n 

/3(A, M) := inf {b > | A C [J B(mf,b) for some rrij £ M}. 

3=1 

and 

k 
a(A) := inf {a > | A C [J A,- for some j4j C A with diam(A,) s^ a}, 

respectively. 

Note that /3(A, A'/) is an "extrinsic" measure of non-compactness, and may depend 
on the ambient space M . On the other hand, a(A) is intrinsic, and is independent of M. 
It is easy to observe that p(A, M) ^ a(A) ^ 2/3(A, M). 

Connections between entropy of linear maps, their Kolmogorov numbers (and other 
s-numbers), and their analytic properties, such as eigenvalues and essential spectrum, 
have been studied extensively (see [15] and references therein). Below we present some 
results illuminating the connections between Kolmogorov numbers and entropy properties 
of metric spaces. 

Theorem 6.9. Suppose A is a bounded subset of a Banach space X . Then 

lim S n (A,X)=p(A,X)= lim S n a} (A, X). 

n— >oo n— >oo 

Proof We only show the equality involving S n (A,X) = 8 n (A,X), as the one with 
5 n a (A, X) is handled in a similar manner. By the boundedness of A, {5 n (A, X)} n c L 1 
forms a non- increasing sequence of nonnegative numbers, hence linin^oo 5 n (A, X) exists. 
(1) (3{A,X) ^ lining S n (A,X). Pick c > b > lim n S n {A,X), and show /3(A,X) ^ 
c. Thus, there exists n such that S n (A,X) < b. This means there there exists an n- 
dimcnsional subspace E of X such that sup aS ^ d(a, E) < b. Let Q — {e £ E : \\e\\ ^ b + 
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sup a£j4 ||a||}. By compactness, Q contains a finite (c — 6)-nct {q n }. Then A C U n B(q n , c), 
hence 0(A,X) < c. 

(2) 0(A,X) > limn^oo <5„(A,X). Let 6 > 0(A,X), and show that <5„(A, A) < b. Find 
a finite 6-net {xj}J =1 C X for A. Then the dimension of E = span({xj}™ =1 ) does not 
exceed n. For any a G A, dist(a, E) ^ mim, \\a — .t.,- | < b, hence <5„(A, X) < b. m 

COROLLARY 6.10. Suppose a complete metric tree T is embedded isometrically into a 
Banach space X, and A is a bounded subset ofT. Then 

lim 5 n (A,X) = lim 5^(A t X) = ^-. 

Proof. Clearly, a(A) < 20(A,X). By Theorem 6.2, a (A) = 20 (A, T) ^ 20(A,X). Thus, 
a(A) = 20(A,X). An application of Theorem 6.9 completes the proof. ■ 

Next we consider affinc Kolmogorov diameters of V(S), where V is an embedding of a 
metric space S into a Banach space X. It is well known (see e.g. [15, 44], or Remark 6.13 
below) that Kolmogorov diameters may depend heavily on the ambient space X. If X is 
a subspace of Y, then 6 n (V(S), Y) ^ S n (V(S),X). Furthermore, if X is contained in a 
A-injcctive space Z, then 5 n (V(S),Z) < X8 n (V(S),Y). Similar inequalities hold for Sn . 

Suppose now that V is an isometric embedding of S into a 1-injective Banach space 
X. Then, by Theorem 3.2, 5 { n a \v(S),X) < d n (S), where d n (S) = <5, ( r ) (C/(S*),£ o(/:)) (the 
universal embedding U : S — > ^oo(£) was defined in Section 3.1). 

Proposition 6.11. Suppose S is a metric space. Let 

d = inf{e > | Kf(S) ^ n}, and c 2 = sup{e > | M € (S) > n + 1}. 

1. Suppose X is a Banach space, and V : S — > X is a 1-Lipschitz map. Then 
6 { n a) (V(S)),X) < ci. Thus, d n (S) < ci. 

2. d n (S) > ca/2. 

Proof. (1) Fix c > Ci, and suppose si, . . . ,s„ is a c-net in S. Let -E be the affine span of 
V(si), . . . , V(s n ). Then dimE ^ n. Furthermore, for any se5, 

d(V{s),E) ^ mind(V(s), V(si)) ^ mmd(s,Si) < c, 

which shows that ^ a) (V"(S')),X) ^ c. As c> ci is arbitrary, 5i a) {V (S)) , X) < cj. 

(2) Let M = n + 1. For c < C2, let si, . . . , sm be a c-separated subset of S 1 . Then 
there exists sq G T such that d(so, s») > c/2 for any i. Indeed, by relabeling if necessary, 
we can assume that d(s\,S2) ^ d(si,Sj) whenever i and j are different. Let sq be the 
midpoint of [si, s 2 ]. We claim that d(so, Si) > c/2 for any i. The inequality clearly holds 
for i G {1, 2}. If i > 2 and d(s , s,:) < c/2, then 

d(si,Si) < d(si, s ) + d(s ,Si) < — ^— + - < d(ai, s 2 ), 

a contradiction. 

Let £ be the set of 1-Lipschitz functions from T to R, taking s to 0. For any a = 
(cti,...,ctm) G {-1,1} m , define g a : {so,si, . . . ,sm} -> {-c/2, c/2} C K by setting 
,9o-( s o) = 0, and g a {si) = 0ic/2 for i ^ 1. Clearly <7 CT is 1-Lipschitz. By the injectivity of 
R, it has an extension h a : S — s> R, belonging to £. 
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Let L = 2 M , and identify {-1, 1} A/ with {1, . . . , L}. Define the map A : 4o(£) -*• £^ : 
(aii)hec —> ( a h„)a=i- Clearly, this is a linear contraction, hence d n (S) ^ Sn (A o U(S)). 
Moreover, U(si) = (h(si)) he c- For 1 ^ i ^ M consider e; =Ao U(si) = (h a (si)) a G £^. 
For any real numbers a%, . . . , a a/, 

|| y~V t e.tj| = max|^a ? /i CT (sj)| = - ^|«i|. 

i i i 

Therefore, e\, . . . , eju are linearly independent. Moreover, for x G span[±ei, . . . , ±cm], 
||x|| ^ c/2 if and only if x G C, where C = conv(±ei, . . . ,±eA/)- In other words, 
C is the ball of a M-dimensional subspace of £^, of radius c/2. By Lemma 2.C.8 of 
[38], Sk{C, £^ ) = c/2 if k < M. The set C is centrally symmetric, hence <5 fc (C,.^) = 
S k a) (C,£^). As n = M- 1, we conclude 

d„(5) > tfW^oCTCS)) > ^)(C,4) ^ <5„(C,^) = |. 

Since this inequality is valid for any c < C2, we are done. ■ 

As an application, we estimate d n (T) for finitely generated trees. That is, suppose 
T arises from a weighted graph theoretical tree T (see Example 1.7). For such a tree, 
denote by |T| the sum of weights (lengths) of the edges of the original graph. 

Corollary 6.12. There exist < c\ < C2 with the property that, for any finitely gen- 
erated tree T, there exists N — N(T) G N such that the inequality ci|T|/n ^ 5 n a (T) Sj 
C2\T\/n holds for any n ^ N = N(T). 

In fact, one can see that N(T) depends on the minimum of lengths of the edges of T. 
Remark 6.13. For a metric tree T, we have no good estimates for iniv,x S n a (V(T), X), 
where the infimum runs over all isometric embeddings V of T into a Banach space X. As 
we are interested in the infimum, we can assume that X = £^{1), for some index set /. 
For certain trees T and isometric embeddings A, S n (V(T), X) can be much smaller than 
d n (T). For instance, pick N G N, and let L = 2 N . Consider a "spider" T with L limbs of 
length 1. More precisely, T consists of the "root" o, and the pairs (i, t), with 1 ^ i ^ L, 
and < £ Sj 1. For convenience, we identify o with (i,0). The metric on T is described 
in Example 1.8. We can embed T into £^ isometrically: let e\, . . . , ex be an enumeration 
of the vertices of the unit ball of £^. Then the map V : T —> £^, taking (i, t) to te,, is 
an isometric embedding. Therefore, S n a (V(T),£^) = for n Js N . On the other hand, 
T contains a 2-scparated set of cardinality L — 2 N (the endpoints of the limbs of T). By 
Theorem 6.11, d n (T) ^ 1 for n < L - 1. 
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